Statistical Inference for Streamed Longitudinal Data

نویسندگان

چکیده

Summary Modern longitudinal data, for example from wearable devices, may consist of measurements biological signals on a fixed set participants at diverging number time-points. Traditional statistical methods are not equipped to handle the computational burden repeatedly analysing cumulatively growing dataset each time new data collected. We propose estimation and inference framework dynamic updating point estimates their standard errors along sequentially collected datasets with dependence, both within between datasets. The key technique is decomposition extended function vector quadratic constructed over cumulative into sum summary statistics batches. show how this can be recursively updated without need access whole dataset, resulting in computationally efficient streaming procedure minimal loss efficiency. prove consistency asymptotic normality our estimator as batches diverges, even independent remains fixed. Simulations demonstrate advantages approach traditional that assume independence Finally, we investigate relationship physical activity several diseases through analysis accelerometry National Health Nutrition Examination Survey.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Nonparametric Regression for Longitudinal Data

In many area of medical research, a relation analysis between one response variable and some explanatory variables is desirable. Regression is the most common tool in this situation. If we have some assumptions for such normality for response variable, we could use it. In this paper we propose a nonparametric regression that does not have normality assumption for response variable and we focus ...

متن کامل

Exploring New Statistical Methods for Causal Inference in Longitudinal Studies

This study proposes a novel method to provide unbiased effect estimates in the presence of time-dependent confounding and applies the method in an existing merged multi-source nursing home dataset to examine the effect of antipsychotic medication use on all cause mortality. Using standard methods, effect estimates of time-varying exposures from observational data will be biased in the presence ...

متن کامل

Simultaneous Statistical Inference for Epigenetic Data

Epigenetic research leads to complex data structures. Since parametric model assumptions for the distribution of epigenetic data are hard to verify we introduce in the present work a nonparametric statistical framework for two-group comparisons. Furthermore, epigenetic analyses are often performed at various genetic loci simultaneously. Hence, in order to be able to draw valid conclusions for s...

متن کامل

Statistical Inference for Large Scale Data

We introduce a sparse and positive definite estimator of the covariance matrix designed for high-dimensional situations in which the variables have a known ordering. Our estimator is the solution to a convex optimization problem that involves a hierarchical group lasso penalty. We show how it can be efficiently computed, compare it to other methods such as tapering by a fixed matrix, and develo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Biometrika

سال: 2023

ISSN: ['0006-3444', '1464-3510']

DOI: https://doi.org/10.1093/biomet/asad010